- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources4
- Resource Type
-
0002000002000000
- More
- Availability
-
40
- Author / Contributor
- Filter by Author / Creator
-
-
Gimpel, Kevin (4)
-
Livescu, Karen (4)
-
Toshniwal, Shubham (4)
-
Wiseman, Sam (4)
-
Ettinger, Allyson (2)
-
Abid, Abubakar (1)
-
Agarwal, Akshat (1)
-
Agha, Omar (1)
-
Alabi, Jesujoba (1)
-
Ali, Tariq (1)
-
Alipoormolabashi, Pegah (1)
-
Aminnaseri, Moin (1)
-
Anand, Sajant (1)
-
Andreassen, Anders Johan (1)
-
Arakawa, Riku (1)
-
Argueta, Cedrick (1)
-
Arnaud, Melody (1)
-
Asaadi, Shima (1)
-
Ashcraft, Courtney (1)
-
Askell, Amanda (1)
-
- Filter by Editor
-
-
null (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Transformer language models have made tremendous strides in natural language understanding tasks. However, the complexity of natural language makes it challenging to ascertain how accurately these models are tracking the world state underlying the text. Motivated by this issue, we consider the task of language modeling for the game of chess. Unlike natural language, chess notations describe a simple, constrained, and deterministic domain. Moreover, we observe that the appropriate choice of chess notation allows for directly probing the world state, without requiring any additional probing-related machinery. We find that: (a) With enough training data, transformer language models can learn to track pieces and predict legal moves with high accuracy when trained solely on move sequences. (b) For small training sets providing access to board state information during training can yield significant improvements. (c) The success of transformer language models is dependent on access to the entire game history i.e. “full attention”. Approximating this full attention results in a significant performance drop. We propose this testbed as a benchmark for future work on the development and analysis of transformer language models.more » « less
-
Toshniwal, Shubham; Wiseman, Sam; Ettinger, Allyson; Livescu, Karen; Gimpel, Kevin (, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP))
-
Toshniwal, Shubham; Wiseman, Sam; Ettinger, Allyson; Livescu, Karen; Gimpel, Kevin (, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing)null (Ed.)
-
Srivastava, Aarohi; Rastogi, Abhinav; Rao, Abhishek; Shoeb, Abu Awal; Abid, Abubakar; Fisch, Adam; Brown, Adam R.; Santoro, Adam; Gupta, Aditya; Garriga-Alonso, Adri; et al (, Transactions on machine learning research)
An official website of the United States government

Full Text Available